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Jets plus missing transverse energy is one of the main search channels for new physics at the 
LHC. A major limitation lies in our understanding of QCD backgrounds. Using jet merging we can 
describe the number of jets in typical background channels in terms of a staircase scaling, including 
theory uncertainties. The scaling parameter depends on the particles in the final state and on cuts 
applied. Measuring the staircase scaling will allow us to also predict the effective mass for Standard 
Model backgrounds. Based on both observables we propose an analysis strategy avoiding model 
specific cuts which returns information about the color charge and the mass scale of the underlying 
new physics. 

I. JETS WITH MISSING ENERGY 

Missing transverse energy is a general signature for dark matter related new physics at hadron colliders fTl. It has 
a long history at the Tevatron and to date gives the strongest bounds on squark and gluino masses in supersymmetric 
extensions of the Standard Model. At the LHC the first new exclusion limits for squarks and gluinos have recently 
appeared, in the CMSSM toy model as well as in a more general setup [5H1]. All of these analyses are based on jets 
plus missing energy including a lepton veto which constitutes the most generic search strategy for strongly interacting 
new particles decaying into a weakly or super- weakly interacting dark matter candidate |1] l5j . 

While the first results are based on very inclusive cuts, following the ATLAS 6 and CMS documentations we 
expect more specific analyses to appear soon. The reason is that in their current form the analyses can and should 
be optimized for specific new physics mass spectra. More specialized analyses for jets plus missing energy rely on 
a missing transverse momentum cut and on a certain number of staggered jet transverse momentum cuts jHl [Z]- 
Unfortunately, they are therefore hard to adapt to modified mass spectra and by definition show a poor performance 
for not optimized model parameters. In addition, they are counting experiments in certain phase-space regions, which 
means that for any additional information on the physics behind an anomaly we have to wait for a dedicated analysis. 

A major problem of searches for new physics in pure QCD plus missing energy final states is the prediction of 
background distributions. Aside from the improved signal-to-background ratio this is one of the reasons why applying 
fairly restrictive cuts on the number of jets and on their transverse momentum is a promising strategy. Such cuts 
relieve us from having to understand the complete px spectra [5] of general exclusive or inclusive njots-jet events 
at the LHC. Experimentally, however, we should by now be in a position to simulate these distributions using the 
Ckkw [SKin] or Mlm [IT] matching methods implemented in Sherpa [T2j, Alpgen [T3], or MadEvent [14^. The 
different approaches have been compared in some detail, for example for VF-|-jets production [T21[T2]. What is still 
missing is a systematic study of theory uncertainties in multi-jet background simulations for top quark analyses and 
new physics searches, i.e. including large jet multiplicities down to intermediate jet transverse momenta, but reflecting 
a well defined hard scale given by the signal process. Motivated by theoretical and statistical considerations we define 
all observables as exclusive, specifically in the number of jets. 

In this paper we establish a proper simulation of multi-jet processes and estimate these theory uncertainties, with 
a focus on the question what actually constitutes the theory error. This way LHC data in control regions can be used 
to understand very generic scaling features {staircase scaling* ) which have already been observed in data [T51 [12] and 
which we can extend based on appropriate Monte Carlo studies. This staircase scaling we can reproduce and study 
using QCD Monte Carlo simulations, including different hard processes and the effects of cuts. Combining these 
simulations with LHC data should give us a quantitative handle on multi-jet rates in many applications. 

Moreover, we can use our knowledge about the exclusive rijots distributions to predict other notoriously difficult 
multi-jet observables. So once we understand the uncertainties on the multi-jet spectra we turn to the effective 
mass. In its many incarnations it either includes the leading jet or it does not and is either limited to four jets 
or any other number of jets [10]. Obviously, any specific definition of this mass variable increases its sensitivity 
to theory uncertainties. We study the most generic definition of the effective mass including all jets visible above 



* Staircase scaling for jet rates is often referred to as Berends scaling. However, to our best knowledge it was first introduced and discussed 
by the authors of Ref . |17| . 
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a transverse momentum threshold. The theory uncertainties of this observable can be closely linked to the jet- 
multiplicity distribution. Using the scaling properties of the exclusive jet multiplicities we can strongly reduce the 
theory uncertainty in the effective-mass spectrum in a consistent manner. The same should be true for other variables 
which we can use to extract new physics from jet dominated backgrounds. 

Similar questions are currently being asked to control regions in a purely data-driven approach. However, the 
conversion from background regions into the signal region either by shifting the kinematic regime or by changing the 
hard core processes off which we radiate jets requires a good understanding for example of the effects of background 
rejection cuts and of background sculpting features in the definition of these observables. These effects we can reliably 
estimate in an appropriate Monte Carlo study and then combine for example with an over-all normalization from 
data. 

Finally, we suggest an analysis strategy which on the one hand makes maximum use of the jet patterns and on 
the other hand does not require any tuning of cuts. The only ingredient of our analysis which does not involve jets 
is a missing transverse energy cut to reduce pure QCD backgrounds and an isolated lepton veto against H^-|-jets 
backgrounds. To reduce both of them to a manageable level we require 

i)T > 100 GeV and a lepton veto if pT,i > 20 GeV, \ye\ < 2.5 (1) 

as the basic and only electroweak cuts to reduce the QCD background. The exact numbers are not very dependent 
on the details of the model as long as the new physics sector provides a WIMP dark matter candidate. To account for 
fake missing energy from QCD jets we apply an additional factor of 1/500 for pure QCD and hadronic top-quark final 
states. This rough fake rate we estimate from Ref. It provides us with a rather conservative estimate compared 
e.g. to Ref. [f . 

After these very generic acceptance cuts a two-dimensional correlation of the effective mass vs the exclusive jet 
multiplicity is the appropriate distribution to extract limits on strongly interacting new physics or in the case of an 
excess study the mass scale as well as the color charge of the new states. Because all our observables are defined 
jet-exclusively we can to a good approximation study this two-dimensional distribution using a log-likelihood shape 
analysis. The contributions of different regions in the njets-Weff space to the binned log-likelihood automatically focus 
on the correct phase-space region and are readily available for improved analyses as well as theoretical interpretation. 

II. JET NUMBER SCALING 

To separate new physics events from a QCD sample after some very basic cuts we have to understand the number 
of jets and their energy or px spectra. This will allow us to exploit two distributions: the number of jets observed 
("■jets) and their effective mass (mcff), where the definition of the latter usually requires us to define the number and 
hardness of the jets included in its construction. Our maximally inclusive approach means that aside from the fiducial 
volume of the detectors all we fix is the algorithmic jet definition to count a jet towards each of the two measurements. 
Throughout this paper we define jets using the anti-fcr algorithm [HI] in Fast Jet [22 with a resolution Ranti-kr — 0-4 
and then require 

PT,j > Pt'"" = 50 GeV and \yj\ < 4.5 . (2) 

This defines which jets are counted towards njets as well as the mofi distribution. Given p™™ we can then evaluate 
the 2-dimensional njcts vs rrics plane in Section |V] using a binned log-likelihood approach. 

Before we can use the njets distribution to extract new physics in the jets plus missing energy sample at the LHC we 
need to show that we understand this distribution in detail. Obviously, the overall normalization of this distribution 
is not critical. For any kind of new physics not completely ruled out by the Tevatron experiments the two jet and 
three jet bins are practically signal free. So the question becomes: what can we say about the shape of dcr/dnjcts- 

For W-|-jets events this kind of distribution has been studied, even at the LHC [12] • We observe the staircase 
scaling [T71I13, an exponential drop in the inclusive njets rates with constant ratios an+i/d'n- The numerical value of 
this ratio is obviously strongly dependent on p^"". The original staircase scaling describes inclusive jet rates, i.e. it 
uses (T„ including all events with at least n jets fulfilling Eq.([2]). In the light of recent advances in QCD and because 
our likelihood analysis should be based on independent bins we define the scaling in terms of exclusive jet rates, i.e. 
counting only events with exactly n jets fulfilling Eq.([2]) towards cr„. This preserves the normalization of the njots 
histogram as crtot = X^n and makes it possible to add the bins in the computation of the log-likelihood. It is 
interesting to note that staircase scaling defined either way implies staircase scaling using the other definition, and 
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Figure 1; Exclusive da/drijots distribution for W+jets (left) and QCD jets production at the LHC. Only the jet cuts given in 
Eq.([2| are applied, neither a cut nor a lepton veto is imposed. The second panel shows the parametric uncertainty due to a 
consistent change of as{mz) between 0.114 and 0.122. The third panel shows the reach of a consistent scale factor treatment 
which can be experimentally determined and should not be considered a theory uncertainty. 



that the jet-production ratios of the two approaches are identical. If we define the universal exclusive staircase-scaling 
factor as 

R = R(n+l)/n = , (3) 

^ n 

we find for the usual inclusive scaling denoted by a hat over all parameters 

O'n (Tn+ (^n+lYl%i}R^ ( 1 - i? ) CTn + i?,(T„ 

The same relation we find when we include a finite upper limit to the number of jets in the sum over j . Note, however, 
that this argument only holds for a strict staircase scaling where the ratio R{n+\)/n does not depend on the number 
of jets n. For our analysis this means that we can use the staircase scaling for a statistical analysis of the da/drijcts 
distribution either in its inclusive or in its exclusive version, the latter based on independent njets bins. 

In this section we will show that (1) such a scaling exists not only for W/Z+^ets but also for pure QCD events 
and (2) we can reliably estimate the scaling factor and possible deviations from it from theory. A purely data-driven 
background analysis of this distribution might be possible and should be combined with our results. For example, we 
can one by one remove the missing energy cut and the lepton veto in Eq.Q which gives us background dominated 
event samples to a reasonably large number of hard jets. Adding the background rejection cuts will then have an 
impact on the scaling, which we can estimate reliably. For the signal hypothesis we have to entirely rely on QCD 
predictions. 

As a starting point we discuss the established staircase scaling in M^-|-jets production. The behavior of Z-f jets is 
exactly the same. For our analysis we produce CKKW-matched [TD] background samples for W^-|-jets (to 5 ME jets), 
Z+iets (to 5 ME jets), tt+jets (to 2 ME jets), and QCD jets (to 6 ME jets) with Sherpa-1.2.3 [H]. Higher order 
corrections to the inclusive scaling we expect to, if anything, improve the assumption of a constant jet ratio R for 
example in W^-|-jets production [24l . 

In the left panel of Figure [T] we show the exclusive njets distribution for the LHC running at 7 TeV center of mass 
energy. To increase our statistics to large enough values of njcts we do not apply the selection cuts Eq.(II]) in this first 
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step. We already see that we can qualitatively fit a line through the central points on a logarithmic axis for each set 
of input parameters. 

Before we quantitatively evaluate this scaling we need to consider the uncertainties associated with our simulation. 
This is crucial if we want to use the njots scaling as a background estimate for new physics searches in QCD final 
states. There are two distinct sources of uncertainty in our simulation. First, there exists a parametric uncertainty, 
namely the input value of as{mz) or some other reference scale. To address this, we consistently evaluate the parton 
densities around the central NLO value as{mz) = 0.118 inside a window 0.114 — 0.122 [5S] and keep this value for 
all other appearances of the strong coupling in our matrix-element plus parton-shower Monte-Carlo simulations. In 
Figure [l] we see that the resulting error bar on the da/dujcts increases with the number of jets, but stays below 30% 
even for the radiation of six jets. For luminosities around 1 fb~^ the error on Us is roughly of the same order as the 
experimental statistical error. Systematic errors we do not consider, even though they will at some point dominate 
over the statistical errors. After any kind of realistic background rejection the combined experimental error will exceed 
the parametric as uncertainty. 

The reason why we cannot use staircase scaling in W^-|-jets to measure as is a second source of QCD uncertainty: 
aside from the parametric as error band, an actually free parameter in our QCD simulation is a common scaling factor 
fi/fj-o in all appearances of the factorization and renormalization scales, including the starting scale of the parton 
shower. Identifying all scales follows the experimental extraction of the parton densities and as in a simultaneous fit. 
The interpretation of DGLAP splitting in terms of large logarithms tells us that the factorization and renormalization 
scales have to be identified with the transverse momentum of the radiated jets. By definition, such leading-logarithm 
considerations leave open the proportionality factor in the relation fj, cx \pT.j I ■ Any constant factor can be separated 
from the dangerous logarithm as a non-leading constant value. 

Because this constant cannot be derived from first principles we vary it in the range fJ-/no = 1/4 — 4 and show the 
numerical result in Figure [l] As expected, the variation of the jet rates with this scaling parameter is huge — much 
larger than the experimental uncertainties we expect from the LHC and which we know from the Tevatron. In Figure[T] 
we can first of all check that introducing such a scaling factor does not seriously impact the observed staircase scaling. 
Counting such a constant towards the theory uncertainty is questionable if we can determine it experimentally. For 
example for Sherpa we know from Tevatron that the scaling factor should essentially be unity [26] , which in the spirit 
of Monte-Carlo tuning means that for example in Sherpa the naive default parameter choice comes out as correctly 
describing the data. Of course, this does not have to be true for other simulation tools. An interesting question to ask 
once we have access to it at the LHC would be if this per se free parameter really is the same for different channels, 
like W/Z+iets and QCD jets. 

In the right panel of Figure[l]we show the same distributions for pure QCD jet production. Again, not applying the 
cuts in Eq.([T]) we observe staircase scaling, however, with some caveats for the two and three jet bins. This is related 
to the definition of the hard process. As expected, the scale factor ^J,/^J,Q has very large impact not on the existence of a 
staircase scaling but on the jet ratio R. The parametric uncertainty due to the error bar on as{mz) is again small once 
we vary the strong coupling consistently everywhere, staying below 30% for up to six jets. The parametric uncertainty 
for the pure QCD case and the VF-|-jets case is clearly very similar. The scale factor variation fi/fJ^o = 1/4 — 4 gives an 
even larger band of possible ratios of cross sections, to be contrasted with a reduced statistical uncertainty compared 
to the iy-|-jets case. Our argument that this over-all scale factor should be determined experimentally is therefore 
even more applicable for the QCD case. To date such an analysis does not exist, so while in the following we will use 
unity as the appropriate scale factor for Sherpa this needs to be verified experimentally. 

Once we understand the size of theory uncertainties for the exclusive da/dujcts distribution we need to quantify 
the quality of the observed staircase scaling. Since the quantitative outcome will depend on the background rejection 
cuts we apply, we study the scaling without the cuts shown in Eq.Q, after the lepton veto only, and including the 
lepton veto as well as the missing energy cut. Starting from the individual R(n+i)/7i values we fit a line through all 
relevant data points, as a function of rtjets 

dR 

Ririjcts) = Ra + -1 JT-jcts , (5) 

and determine the slope to compare it to our prediction dR/drijcts = 0. 

In Table |l] we list the exclusive jet ratios as shown in Figure I] For the M^/Z-|-jets case we see that the radiation of 
one compared to a second jet off the Drell-Yan process R2/1 does not show this scaling. The reason for this specific 
feature is the definition of the hard core process alluded to before. To generate the relatively hard jets and the large 
missing energy mimicking the signal events we need to at least consider W/Z+1 jet as the core process. In addition. 
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channel (cuts) 


R2/1 R3./2 Ri/3 -R5/4 Re/s R7/6 
Sherpa simulation 


dR 

linear fit 


W+jets {pT,j > 50 GeV) 
MZ+jets (+ lepton veto) 
H/ +jets pT > tUU LreV j 


0.1931(3) 0.1494(5) 0.157(1) 0.138(3) 0.115(8) 0.09(2) 
0.2290(4) 0.1494(7) 0.164(2) 0.139(4) 0.12(1) 0.09(2) 

^OKO/1^ noo/i/'o\ ninn/K\ nic/i\ ^1K/'o^ 
U.2o2(ij U.ZZ'l(Z) U.iyO(oj U.iD(ij U.it3(2j U.09(4j 


0.150(1) -0.001(1) 
0.149(1) —0.002(1) 

n OQ^;'o^ n noo/o^ 
0.2oy^^j — U.032^^J 


Z+jets (,Ptj > oU (jeV j 
Z+jets (+ iiT > 100 GeV) 


0.i4D3(2j U.ioU4(Dj 0.14/(^1) U.io8(4j U.i2^(9j 0.0(^2j 
0.2251(6) 0.185(1) 0.166(3) 0.154(6) 0.14(1) 0.08(3) 


0.1o4(lj — U.006(lj 

0.193(2) -0.018(2) 


QCD jets (pTj > 50 GeV) 


— 0.0552(1) 0.1074(5) 0.106(1) 0.125(5) 0.12(1) 


0.105(2) 0.001(1) 


(tt)ft,?i+jets (ptj > 50 GeV) 


3.69(9) 1.26(2) 0.67(1) 0.366(9) 0.24(1) 0.15(5) 




(ii)«/h+jets (pTj > 50 GeV) 
{tt)u/h+iets (+ lepton veto) 
(if)«/h+jets (+ > 100 GoV) 


1.96(2) 0.851(7) 0.465(5) 0.260(5) 0.168(8) 0.12(2) 
1.75(2) 0.765(10) 0.391(7) 0.228(8) 0.14(1) 0.12(3) 
1.60(5) 0.83(2) 0.49(2) 0.25(2) 0.15(2) 0.19(7) 





Table L Jet ratios for all Standard Model channels, including (semi-)leptonic and hadronic top pairs for the central scale choice 
fi — Ho- The quoted errors are statistical errors from the Monte Carlo simulation. The numbers correspond to the curves shown 
in Figures [1] and [2] 



we do not take into account any separation criterion between the first jet and the gauge boson, which means we treat 
CTi different from all other (t„. In Table |I] we see that we are lucky for the Z+jets case, but we are not for the W+jets 
case. The tricky definition of the hard process cri as the base of additional jet radiation suggests that we start our 
staircase scaling analysis with -R3/2. 

The statistical uncertainties which we show in Table |l] and which enter the fit of the slope as defined in Eq. ([s]) 
always increase towards larger jet multiplicity. This is an effect of the way we simulate these events which completely 
corresponds to an experimental analysis. If we generate (or measure) all rijots bins in parallel the first bin will always 
have by far the smallest error. Therefore, it determines the constant scaling factor Rq in our fit as well as in a 
measurement. For larger values of njots we become statistics dominated, which means that Monte Carlo simulations 
can extend the reach of actual measurements at any given point in time towards larger jet multiplicities. This is the 
phase space region in which we need to provide new physics searches at the LHC with accurate background estimates. 

Some of the rows listed in Table |l] we also depict in Figure [2j For electroweak gauge boson production we see that 
without any cuts W and Z production show the same scaling parameter Rq as well as a small negative slope. Within 
errors the staircase scaling holds to six and possibly seven jets, even though we see a slight slope developing towards 
larger numbers of jets. This is a phase space effect which is expected once we start probing gluon parton densities 
and their sharp drop towards larger parton momentum fractions and which is well modeled by our computation. 

Adding the lepton veto does not change the staircase scaling at all. This means that forcing the W boson to 
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Figure 2: Jet ratios for Z+jets production (without and with the /^t > 100 GeV cut) and QCD jets production, corresponding 
to the numbers listed in Table |l] The error bars indicate the remaining Monte Carlo uncertainty in our simulation. 
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decay into one fairly soft lepton and a harder neutrino does not affect the behavior of the recoiling jets. Adding a 
significant cut, on the other hand, has a measurable effect on the jet ratios as well as on the slope. For experimental 
applications of this scaling, however, it is important to note that the phase space effects for large n-^cts as well as the 
effect of kinematic cuts are completely described by our simulation. 

For pure QCD events we find a remarkable agreement with the staircase scaling hypothesis, which seems to be 
supported by recent LHC analyses |27j . The definition of the hard core process is somewhat problematic since there 
exists no inherent hard scale in the 2 — > 2 process and the infrared behavior of s-channel and t-channel diagrams 
is very different. Therefore, we define tia as the starting point of our analysis and -R4/3 as the first relevant cross 
section ratio. Table |l] and Figure [2] show that the ratios -R(,i+i)/„ are essentially constant to eight jets. The slope 
within statistical uncertainties is, m contrast to W jZ production, fully compatible with zero. The central i?o values 
for M^/Z+jets and QCD jets production are slightly different, which is expected by the different core processes and 
by the different background rejection cuts. 

III. DECAY JETS VS JET RADIATION 

In contrast to the QCD and gauge-boson background njcts distributions from heavy particles decaying to jets 
include two sources of jets: first, there are decay jets, which dependent on the spectrum might or might not be 
hard enough to stick out. Second, there is QCD jet radiation, which for heavy states will generically be relatively 
hard and dominated by coUinear splitting in the initial state j281 129j , leading to a non-zero maximum value of the 
number of expected initial-state radiation jets [HIS^. Due to the hard scale of new-physics processes on the one hand 
and because we need to simulate supersymmetric decays inclusively we best generate the new-physics events with 
Herwig-|--|— v2.4.2 [3T] and normalize the cross sections with PR0SPIN02.1 All supersymmetric mass spectra we 
generate with SoftSusy using the SLHA output format jMj and use Sdecay [3S] to calculate the leading-order 
branching ratios. We check the jet-radiation results from the Herwig-|--|- shower with Mlm merging implemented in 
MadEvent [53], using Pythia [3B] for parton showering and hadronization. As expected, the two simulations agree 
well within their uncertainties. 

The question for heavy-particle production is how universal its njets distributions are when we consider Standard 
Model as well as new-physics particles with different masses and color charges, like top quarks, squarks and gluinos. 
In Figure |3] we first show the njots distributions for (semi-)leptonic and hadronic top-pair production. We see how 
all unsubtracted distributions show maxima away from njcts = 0, driven by the presence of decay jets plus relatively 
hard jet radiation. In addition, they do not show a staircase scaling at large jet multiplicities. Because the particles 
produced in the hard process have non-negligible masses even compared to the hadronic center-of-mass energy the 
phase-space suppression for example due to rapidly dropping gluon densities kicks in immediately and bends the 
otherwise exponential fall-off. 
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Figure 3: Normalized exclusive da/dujcts distributions for top pairs (left) and supersymmetric particle production. For the 
latter we show all decay jets plus QCD jet radiation (center) as well as QCD jet radiation only (right). Jets are counted once 
they fuIfiU Eq.Q. 
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{tt)hh {tt)u/h 


qq qg gg | SUSY 


qq qg gg 




full 


full 


jet radiation 


ao 

Ol 

b 


3.13 2.34 
5.41 3.73 
1.25 1.07 


2.89 3.53 4.16 
5.28 6.16 7.15 
1.71 1.25 1.09 


3.15 
5.48 
1.27 


n.a. n.a. n.a 
0.45 0.36 0.21 
1.14 1.07 0.98 



Table II: Parameters defined in Eq.([6| and extracted from the unsubtracted distributions shown in Figure [3] The parameter 
ao corresponds to the position of the maximum while b captures the approximate scaling at larger njots. The combined 
supersymmetric result is based on the appropriately weighted event samples for squarks and gluinos. 



In the Standard Model we can fit the (semi-)leptonic and purely hadronic top-pair distributions for all jets fulfilling 
Eq.([2]) to the function 



dlogcr(njets) _ , "fots - «l"-jots 



drii 



'jets ^jcts 



(6) 



The two relevant fit parameters for the normalized distributions shown in Figure [3] correspond to the maximum at 
njots = ciQ, and the (staircase) scaling parameter for QCD jet radiation at large njcts given by R = exp(— 6). Because 
we do not include higher suppression terms towards large rtjets we stop the fit at the endpoints of the curves shown 
in Figure [3 

In Table 11^ we list the best fit values for these parameters for both top decays. We immediately see more quanti- 
tatively than in Figure |3] that for example hadronically decaying top pairs on average include not even one more jet 
than the (semi-)leptonic sample. Typically only one of two jets from the W decay is accounted for because of the 
cutoff at p™'" = 50 GeV. Comparing this value to the W mass it is likely that one of the two W decay jets gets 
boosted above p™'", but the other one stays below. In contrast, the Jacobian peak of the 6-quark energy from the top 
decay lies above p™'". Going back to Table jn] this means that for top pairs the most likely number of radiated jets is 
zero, closely followed by one jet emission |30j . 



For squarks and gluinos the features we see in top-pair production become more pronounced and the jet multiplicity 
reflects the color charge of the produced particles. As a reference point in supersymmetric parameter space we consider 
reasonably low mass gluinos and squarks in the SPSla benchmark scenario [37], with nig — 608 GeV and typical 
light-flavor squarks around rriq ~ 558 GeV. The new LHC exclusion limits are right at the edge of excluding this 
standard parameter choice^ . Because the gluino cannot decay to a gluon it requires two quarks to get rid of its color 
charge. Squark pairs, including squark-antisquark production, predict two hard decay jets plus some QCD radiation 
and sub-leading decay jets. In Table |TT] we see that for this production channel the maximum of a continuous njets 
distribution indeed resides almost at njots = 3. For associated squark-gluino and gluino-pair production the number 
of jets increases by almost one, corresponding to the second gluino-decay jet which not in all cases is hard enough 
to appear after requiring p^'" — 50 GeV. The jet multiplicity of the entire supersymmetric sample is close to the 
average for squark pair production and squark-gluino production which reflects the hierarchy in cross sections of the 
three processes [52] . 

Breaking down the supersymmetric signal into individual production processes we can examine the distinct radiation 
patterns. Gluino pairs radiate significantly more than associated production or squark pairs, which is reflected in the 
right columns of Table [ll} b{gg) < b{qg) < b{qq). The scaling parameter R = exp(— 6) is consistently larger than for 
the background samples in Table |lj For example for the jet radiation off squark pair production we find R w 0.32. 
Moreover, in Figure [3] we see that the jet rates for QCD radiation drop off even faster for large multiplicities. This 
means that there definitely does not exist any staircase scaling behavior for heavy particle pair production above a 
threshold of 1 TeV at the LHC with a hadronic center-of-mass energy of 7 TeV. This phase space argument should 
not be mixed with the fact that the hard scale of such processes and with it the logarithmic enhancement for collinear 
radiation is large, i.e. the validity of the collinear approximation extends to larger values of ptj- 

Finally, in Figure |4] we show the njcts distribution for the supersymmetric signal assuming the SPSla parameter 
point and the various Standard Model backgrounds. We apply the background rejection cuts specified in Eqs.Q and 



^Due to the presentation of the LHC results in the mo vs mi/2 plane it is also not possible to precisely read off the actual limits in terms 
of physics mass parameters. Moreover, since squark and gluino masses are both mostly driven by mi/2i there does not exist a mapping 
of the mo-Tni/2 plane into the squark-gluino mass plane. Models with significantly heavier gluinos than quarks are excluded in CMSSM 
searches. 
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Figure 4: Exclusive njots distribution for all considered Standard-Model backgrounds and the SPSla signal for supersymmetry. 
We present the results for an LHC center-of-mass energy of 7 TeV with an integrated luminosity of 1 fb~^ and after the cuts 
specified in Eqs. ([T| and ([2|. 

([2]). The variation in shape when including the signal events is statistically significant and appears as an excess of 
high jet-multiplicity events for njcts > 5. The associated statistical significance we compute in Section [V] 

IV. EFFECTIVE MASS 

Before we turn to exploit the number of jets to extract a new physics signal at the LHC an obvious question is if 
we can make use of our understanding of the njets distribution looking at other observables in multi-jet final states. 
More specifically, we will use the measured scale parameter ^/^.q shown in Figure [T] to reliably predict observables, 
which, based on traditional QCD simulations, show an overwhelming theory uncertainty. A classic observable in this 
respect is the effective mass [2D], which for exclusive jet multiplicities we define as 

mcff ^i)T+ ^ PT,j , (7) 

all jets 

including all jets fulfilling Eq.([2]). This definition is neither optimized to take into account a correlation between hard 
jets and the missing-energy vector nor to remove hard initial-state radiation. Instead, Eq.([7]) makes a minimal set of 
assumptions to avoid sculpting the background distribution. 

Just like the njets distribution rUeg in the Standard Model cannot be reliably predicted by parton-shower Monte 
Carlos. Jets entering the sum in Eq.([7| we have to understand over their entire transverse-momentum spectrum. 
Ckkw [9l [To] or Mlm [TT] matching is therefore the most adequate approach for simulating rrics- 

Exactly following the treatment of the njots distribution in SectionllTJwe estimate two sources of theory uncertainties, 
the parametric error from varying the strong coupling and the scale-variation systematics. To not be limited by 
statistics of our background samples we for now discard the missing energy cut and the lepton veto and instead study 
the fully inclusive processes. In Figure [5] we present the mcff distribution for M^-l-jets and the QCD jets production 
with rijots > 2. The same way as in Figure IT] we show the relative impact of the two sources of uncertainty in the 
lower panels. The parametric error from as(mz) ranges well below 20% even towards large values of mcff. For the 
electroweak process this is of similar size to the expected statistical error for an integrated luminosity of 1 fb^^. As 
expected, towards large rricff the error band increases, but not dramatically. 

In contrast, the scale-factor variation fi/fJ-o = 1/4 — 4 has a huge effect on the rricff simulation, essentially rendering 
it unpredictive. For values above moff — 500 GeV the error bands become large enough to make it impossible to 
extract new physics from this observable, were we to consider the scale variation a proper theory error. However, 
measurements of multi-jet rates and other jet observables at Tevatron and LHC indicate that for the case of Sherpa 
this scaling factor is approximately one |26| . Measuring the staircase-scaling factors even more precisely with the 2011 
LHC data will further constrain the scale ambiguities underlying our QCD simulations - allowing us to make reliable 
predictions for e.g. the TOoff observable. 
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Figure 5: Effective mass distribution for M^+jets and QCD jets production. Only the jet cuts given in Eq.([2| are applied. 
The second panels show the parametric uncertainty due to a consistent change of as(mz) between 0.114 and 0.122. The third 
panels show a consistent scale factor variation which can be experimentally constrained and should not be considered a theory 
uncertainty. 



To see the impact of rrics in searches for supersymmetry we show the rrios distribution for exclusive 2-jet and 3-jet 
events in Figure |6] It includes Standard-Model backgrounds as well as the supersymmetric signal. All jet-selection 
and background-rejection cuts specified in Eqs.Q and ^ are applied. As mentioned before, QCD jets are the by far 
dominant channel. Only for TOoff > 800 GeV the signal starts overcoming the backgrounds. The statistical uncertainty 
for 1 fb^^ we indicate by the shaded regions in the lower panels. It is worth noticing that the signal-|-background 
sample when compared to the pure background sample exhibits a maximum at around nics ^ 1.1 TeV. This scale 
corresponds to the squark and gluino masses which for pair production add to 1100 to 1200 GeV. This means that the 
TOeff distribution for exclusive jet multiplicities can serve as background rejection as well as a measure for the mass 
scale of the new heavy colored states. 




Figure 6: Effective mass distribution for exclusive 2-jet and 3-jet events for Standard-Model backgrounds and the supersym- 
metric signal using the SPSla parameter point. We assume a center-of-mass energy of 7 TeV with an integrated luminosity of 
1 fb~^ and apply all cuts in Eqs.([l]) and 
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signal significance 






for 35 pb"^ 


inclusive 


0.2 a 


rijets (ID) 




1.6(7 


TOefl (ID) 




3.3(7 


(rijots, JTlcff-^ 


(2D) 


4.6(7 



Table III: Confidence levels for the signal plus background sample ruling out the background-only hypothesis based on one and 
two dimensional log-likelihood distributions. The supersymmetric mass spectrum is given by SPSla. 



V. AUTOFOCUSING 



Following our results in the previous sections we should be able to use the shapes of the njots and rrioff distributions 
to extract a supersymmetric signal from the now quantitatively understood Standard Model backgrounds. Given 
that the two distributions are affected independently by the color structure of the new physics sector and by its 
mass scale(s) we will assess the power of the two-dimensional njcts vs rrioff correlations in extracting a discovery or an 
exclusion. Such a two-dimensional shape analysis is the natural second step after the first completely inclusive searches 
based on counting events. According to Sections [II|IV| systematic experimental uncertainties will start dominating for 
luminosities around 0(1 fb~^). Since those are subject to continuous refinement during data taking and need to be 
addressed within a full detector simulation study we limit ourselves to statistical uncertainties for a given luminosity. 
While this means that we will not obtain reliable estimates for the discovery reach, we will see that it allows us to 
discuss the main benefits and limits of the proposed analysis. 

As supersymmetric reference models we choose the benchmark point SPSla, two variations of it, and SPS4. Again, 
we only apply the cuts given in Eqs.Q and ^ and use the exclusive definition of njots and moff. For the rrioff 
distribution we choose a binning of 100 ueV, which approximately reflects the experimental resolution towards large 

TOoff. 

For given background and signal -f background hypotheses we use a binned log-likelihood ratio to compute statistical 
significances assuming statistically uncorrelated bins 



bins 



nt log ( 1 



(8) 



It includes the luminosity via the signal and background event numbers Si and bi in each bin. While it avoids the 
limitations of S/\/B in regions requiring Poisson statistics it approaches a Gaussian limit for each individual channel 
when the bin content becomes large. Some features of this well established approach we summarize in Appendix [A] 
Applying a "simple hypothesis test" tells us how likely it is that the background-only hypothesis fakes the predicted 
signal-fbackground distributions as a statistical fluctuation, i.e. we define the p-value as the SPSla likelihood ratio's 
median. The likelihood ratio given in Eq.(j8]) we compute for the exclusive njets, Woff, and two-dimensional (njots, Wcff) 
distributions. In this two-dimensional plane the definition of rrioff, following Eq.([7]), only includes exactly njots jets. 
With this completely exclusive definition of rijots and moff we ensure that the sum over all bins in the (rijotsj^off) 
reproduces the total cross section. 



Considering this correlation is similar in spirit to the (^t, Ht) analysis proposed in Ref. (38j. However, first we focus 
on the Tijets and rriefr distributions because in Sections 
the staircase scaling behavior of the Standard Model backgrounds and translate its precision into other variables. In 
addition, as we will see in this section these two variables play a special role, as they not only distinguish signals 
from backgrounds, but also contain information on the structure of the underlying new-physics model. As mentioned 
above, for the sake of a proof of concept we ignore all uncertainties except for statistical experimental errors, to avoid 
correlations in the definition of the log-likelihood. 



npV we have shown that we can quantitatively understand 



We can expect from Figures ffl and m that the rate in each individual njots bin is dominated by Standard-Model 
processes at low moff- Most likely, this region will be the control region to normalize the QCD and M^/Z-|-jets 
backgrounds. With the exception of hadronically decaying top pairs all Standard-Model channels will then show 
a simple decrease in both directions of the two-dimensional (njots , 'm-cs) plane which we can predict following the 
arguments in Sections [Tl|lV[ The signal contribution will become visible only once rnes reaches the mass range of the 
particles produced. 
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In Table III we compare the statistical significances for the supersymmetric SPSla parameter point at 7 TeV 
center-of-mass energy for the various analysis strategies: first, we show the results based on the total production rates 
after the inclusive cuts of Eqs.([T) and ([2]). As expected, including the signal events leaves us completely consistent 
with the background-only hypothesis. Next, the likelihood ratio computed from the njets distribution gives rise to 
sizable deviations from the background for integrated luminosities as small as 35 pb~^. The one-dimensional rricff 
distribution turns out to be an even better discriminator. It gives us more than twice the njots significance, namely 3.3 a 
for £ = 35 pb~^. The highest significant discriminative power we obtain for the two-dimensional binned (jT-jotsj ™off) 
case. This is a direct consequence of the additive binned log-likelihood given in Eq.(|8]). 

Beyond the relevance of the {rijetsi'm-es) distributions to extract new particles from backgrounds, we can utilize it 
to study signal properties. Above, we argue that new physics contributions to njcts will only appear once rUeS reaches 
the mass scale of the sum of both heavy particles produced. However, this only happens if the exclusive njcts value 
allows us to include the decay jets contributing to moff. Hence, the new physics contributions to the two observables 
will show a correlation based on the mass and decay channels of the new particles produced. The decay channels can 
typically be linked to the color charge of the new particles if we assume that the missing energy particle cannot carry 
color charge. Color triplets will tend to decay to one hard quark jet while color octets with their diagonal coupling to 
gluons will radiate two quark jets. This means breaking down the binned log-likelihood ratio over the fully exclusive 
(njotsj^off) plane and keeping track of the individual contribution of each bin will automatically focus our search on 
the appropriate properties of the particles we are looking for. 

This statement is not limited to supersymmetry, the SPSla parameter point or any other assumption about the 
signal. It can be applied to general physics beyond the Standard Model with strongly interacting new particles and 
a stable dark matter candidate. In Figure [7] we show the contributions of the individual bins to the summed log- 
likelihood ratio for all signal events combined and split into three production processes. The maximum significance 
automatically reflects SPSla's decay paradigm gg*^*) — > 2 jets and g^*\g 3 jets, and gg 4 jets, know already 
from Figure The first two channels we can study using an integrated luminosity of 1 fb~^. Squark pair production 
is dominant because at the LHC it includes a quark-quark initial state. Associated production, which often is the 
dominant channel at the LHC, has a comparable statistical yield and features a slightly higher rrics range. Both 
channels combined define the diagonal correlation we see for the combined signal events. 

Gluino pair production has the smallest production rate and therefore becomes subleading in the combined su- 
persymmetry sample. However, for this channel we can best follow the imprint of higher jet multiplicities. Due to 
its large mass and its color charge gluino pairs produce significantly more jet radiation which we can resolve for a 
sufficiently low p™'" threshold. For p™™ = 50 GeV we might just capture the first decay jet from the gluino cascade, 
refiecting the mass hierarchy nig — rriq ^ 60 GeV. The peak in the log-likelihood plane around Tijots = 4 results from 
the maximum in the gg production cross section. For njots = 5 the background is still large compared to the signal, 
but dropping at an exponential rate it gets surpassed for njcts = 6, explaining the structure we observe in Figure [7j 

Finally, we can study how changes to the new physics spectrum are refiected in the significances computed from the 
binned log-likelihood (5(njets, Woff)- We investigate three different supersymmetric mass spectra : first, we increase 
only the gluino mass by 150 GeV with respect to SPSla (crgusy = 2-69 pb according to Prospino2.1 [32]); second, 
we increase all colored-sparticle masses by 100 GeV with respect to SPSla (crgyg^ = 1.63 pb); third, we consider the 
SPS4 benchmark [37' with an inverted mass hierarchy - 750 GeV > mg - 730 GeV (crguSY = 0.83 pb). AU of 
these cross sections are significantly smaller than for SPSla with its crgusY ~ 4.68 pb, which means we increase our 
nominal luminosity to £ = 5 fb^^. 



SPS1 a, 1 inverse fb SPS1 a, squarks+squarks, 1 inv. fb SPS1 a, squarks+gluino, 1 inv. fb SPS1 a, gluino+gluino, 1 inv. fb 

m.(i(TeV) fTTotfCTeV) niojCTeV) me,(TeV) 




Figure 7: Log-likelihood contributions over the (njots, ?Tioff) plane for the supersymmetric signal using the SPSla spectrum. 
The color code is normalized to different maximum significances. 
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SPS1a, gluino+150 GeV, 1 inv. fb 
mon(TeV) 




SPS1 a, g+1 50 GeV, sq+g, 1 inv. fb 

moii(TeV) 




SPS1 a, spartons+1 00 GeV, 5 inv fb 
moti(TeV) 




SPS4, 5 inv. fb 

mjnCTeV) 




Figure 8: Log-likelihood distributions over the (njcts, JtIcEf) plane for the different supersymmetric spectra. Due to the smaller 
signal rates we present results for an integrated luminosity of 5 fb~^. 

In Figure [8] we clearly see the effect of the increased gluino mass. The mog peak for associated squark-gluino 
production moves to larger values, as does the rijots maximum. However, because the balance between squark pair 
production and associated squark-gluino production shifts into the direction of the squark pairs, this effect is not quite 
as pronounced. The second scenario with increased squark and gluino masses leads to a pronounced maximum at 
larger rrics. Due to the smaller signal cross section the sensitivity in particular in the njots = 2 bins gets considerably 
diminished, appearing as a shift towards higher njots values. For the all-hadronic search in the SPS4 parameter point 
longer decay chains for gluinos through bottom squarks appear in the high njcts bins only. 

The SPS4 case illustrates that njots — 6 does not have to be the maximum jet multiplicity we need to consider. 
Once we rely on a combination of data and Monte Carlo methods to describe the njets staircase scaling for background 
processes we can extend our analyses to very large jet multiplicities. On the other hand. Figure [7] also clearly indicates 
that for example in the SPSla parameter point the optimal signal extraction strategy by no means requires us to go 
to very large jet multiplicities. For the SPSla parameter point the two-jet bins are leading contributions to the total 
significance. 

VI. OUTLOOK 

Multi-jet events with and without a weak gauge boson are the dominant backgrounds to inclusive searches for 
example for supersymmetry. Simulating them with traditional parton shower Monte Carlos does not lead to reliable 
results. This changes when we make use of jet merging to predict the shapes of multi-jet backgrounds. 

One of the most challenging distributions to describe theoretically is the inclusive or exclusive number of jets per 
event. From data we know that the inclusive W/Z +iets distributions without rigid cuts follow a staircase scaling 
with constant jet ratios R — Un+i/cfn- We have shown that staircase scaling in the inclusive jet rates is equivalent to 
the same scaling in exclusive rates. Nowadays, the exclusive scaling we can compare to the predictions of jet merging 
Monte Carlos, like Sherpa. Moreover, we have seen that QCD jet production shows an even more pronounced scaling 
than W/Z+]eis production to jet multiplicities of njets = 8. 

We studied the validity of the staircase scaling behavior, the theory uncertainties in the njets distributions, its link 
to other multi-jet observables, and the application of jet-exclusive observables to new physics searches and found: 

1. While we cannot derive the staircase scaling of the njots distribution from first principles we can reproduce it 
using the appropriate Monte Carlo tools. This includes the scaling feature itself, a careful error analysis, and 
the scaling violation effects towards large values of njots due to phase space restrictions. 

2. The theory uncertainty on the staircase scaling consists of tunable parameters like scale factors in the factoriza- 
tion and renormalization scales and on parametric errors like the dependence on a^. The latter are small. The 
scale factor hugely overestimates the error and should be thought of as a tuning parameter for the different jet 
merging implementations. For Sherpa it comes out close to unity. 

3. The scaling parameter R — (T„+i/cr„ depends on the hard process and on kinematic cuts. Both effects we can 
reliably predict using Monte Carlos, as we have shown for the W/Z+^ets and pure jets cases. The 7-l-jets case 
we postpone to a later study with more specific details for the LHC experiments [55] . 
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4. These simulations of the staircase scaUng in muhi-jet processes can be easily combined with data driven tech- 
niques, giving us the over-all normalization and a cross check for the first njets bins. Statistically limited regions 
of phase space will become accessible via simulations, including a reliable error estimate. 

5. Understanding staircase scaling of multi-jet processes allows us to predict other multi-jet variables, like the 
effective mass rricff- Again, this includes a proper treatment of theory uncertainties. In addition, the completely 
inclusive definition of rrios removes dangerous artifacts due to the usual truncations. 

6. Based on for example the njots vs rrioe correlation for a fixed p™™ we can define a likelihood-based analysis 
avoiding model or spectrum specific background rejection cuts. Such shape analyses in multi-jet search channels 
are the natural extension of the early inclusive ATLAS and CMS searches. 

Of course this simple first approximation to the exclusive zero-lepton search for jets plus missing energy is not the 
only application of such methods. Searches including leptons, b tags, or hard photons will benefit from the same 
treatment, as long as they include non-negligible numbers of jets. The same is true for hadronically decaying top 
quarks in the Standard Model. 
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Appendix A: Hypothesis test 



In this section we briefly review the binned log-likelihood ratio hypothesis tests which we apply in Section [Vj It 
discriminates between two specific hypotheses and has been used for the combined LEP-Higgs limits [40j, Tevatron 
analyses [41] , and in various contexts of LHC Higgs phenomenology [42] . According to the Neyman-Pearson lemma [43] 
the likelihood ratio is the most powerful test statistic (e.g. signalH-background vs background-only). We compute 
the (binned) log-likelihood ratio 



-21ogQ ^ -2 log 



L{data,\S + B) 
L(data I B) 



n log ( 1 + - 



binned , 



E 

-iGbins 



ri j log ( 1 + — 



(Al) 



where s and b denote the signal S and background B event numbers for a given luminosity, split into the bins i. 
The probabilities to observe n events given the expected numbers s and b in Eq.(Al| are determined by Poisson 




n,^„, L=35pb ', 1.6c 

background only 

signal+background 



5 10 

-2 log Q 



Figure 9: Log-likelihood ratio distributions based on the njots discriminator for a luminosity of 35 pb ^. The confidence level 
is computed by evaluating the overlap of the background-only distribution with the signal+background maximum. 
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distributions 



(6 + s)"e-(*+^) 6^ 



L(data \S + B)= ' ' , L(data | B) = — . (A2) 



The sum in Eq.(Al) extends over all contributing channels. The likelihood distributions we generate as pseudo-data 
around each hypothesis' central value, which means that in principle we can include any kind of correlation. In this 
work we limit ourselves to statistically independent bins i of the njcts and m^g distributions. The set of entries in 
each bin {n^} we simulate numerically and histogram them as a function of Q, following the Neyman-Pearson lemma. 
To simulate the log- likelihood distributions we need to specify which hypothesis the bin entries {ui} should follow, 
i.e. we can compute Qs+b or Qg. In Figure |9] we show both Q distributions for the binned one-dimensional njots 
distribution studied in our paper. 

In our analysis we are interested in the probability that the background alone fakes the expected signal-|-background 
distributions. This confidence level is given by the integral of the background distribution over the sig- 
nal+background range, indicated by the red-shaded region in Figure [9j This signal-|-background range is defined 
as all likelihood values above the median of the likelihood distribution assuming the signal-|-background hypothesis 

/{Qs+b) / Z \ 

QQe=crfcf^j , (A3) 

where for illustration purposes we convert the confidence levels into the Gaussian number of standard deviations Z 
via the inverse error function. 
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